AITopics | decoder part

Collaborating Authors

decoder part

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

3e53d82a1113e3d240059a9195668edc-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 19:30:17 GMT

architecture, opération, proceedings, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > Madison County > Anderson (0.04)
Asia > China > Guangdong Province (0.04)
Asia > China > Anhui Province (0.04)

Genre: Instructional Material (0.34)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

3e53d82a1113e3d240059a9195668edc-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 12:48:22 GMT

architecture, opération, proceedings, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > Madison County > Anderson (0.04)
Asia > China > Guangdong Province (0.04)
Asia > China > Anhui Province (0.04)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks

Erisen, Serdar

arXiv.org Artificial IntelligenceFeb-12-2024

Improving the efficiency of state-of-the-art methods in semantic segmentation requires overcoming the increasing computational cost as well as issues such as fusing semantic information from global and local contexts. Based on the recent success and problems that convolutional neural networks (CNNs) encounter in semantic segmentation, this research proposes an encoder-decoder architecture with a unique efficient residual network. Attention-boosting gates (AbGs) and attention-boosting modules (AbMs) are deployed by aiming to fuse the feature-based semantic information with the global context of the efficient residual network in the encoder. Respectively, the decoder network is developed with the additional attention-fusion networks (AfNs) inspired by AbM. AfNs are designed to improve the efficiency in the one-to-one conversion of the semantic information by deploying additional convolution layers in the decoder part. Our network is tested on the challenging CamVid and Cityscapes datasets, and the proposed methods reveal significant improvements on the existing baselines, such as ResNet-50. To the best of our knowledge, the developed network, SERNet-Former, achieves state-of-the-art results (84.62 % mean IoU) on CamVid dataset and challenging results (87.35 % mean IoU) on Cityscapes validation dataset.

information, segmentation, semantic information, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.48550/arXiv.2401.15741

2401.15741

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
(3 more...)

Genre: Research Report > Promising Solution (0.34)

Industry: Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing

Yang, Shangshang, Yu, Xiaoshan, Tian, Ye, Yan, Xueming, Ma, Haiping, Zhang, Xingyi

arXiv.org Artificial IntelligenceOct-2-2023

Knowledge tracing (KT) aims to trace students' knowledge states by predicting whether students answer correctly on exercises. Despite the excellent performance of existing Transformer-based KT approaches, they are criticized for the manually selected input features for fusion and the defect of single global context modelling to directly capture students' forgetting behavior in KT, when the related records are distant from the current record in terms of time. To address the issues, this paper first considers adding convolution operations to the Transformer to enhance its local context modelling ability used for students' forgetting behavior, then proposes an evolutionary neural architecture search approach to automate the input feature selection and automatically determine where to apply which operation for achieving the balancing of the local/global context modelling. In the search space, the original global path containing the attention module in Transformer is replaced with the sum of a global path and a local path that could contain different convolutions, and the selection of input features is also considered. To search the best architecture, we employ an effective evolutionary algorithm to explore the search space and also suggest a search space reduction strategy to accelerate the convergence of the algorithm. Experimental results on the two largest and most challenging education datasets demonstrate the effectiveness of the architecture found by the proposed approach.

architecture, opération, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2310.0118

Country:

North America > United States > Indiana > Madison County > Anderson (0.04)
Asia > China > Guangdong Province (0.04)

Genre: Research Report (0.65)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

LRDB: LSTM Raw data DNA Base-caller based on long-short term models in an active learning environment

Rezaei, Ahmad, Taheri, Mahdi, Mahani, Ali, Magierowski, Sebastian

arXiv.org Artificial IntelligenceMar-15-2023

The first important step in extracting DNA characters is using the output data of MinION devices in the form of electrical current signals. Various cutting-edge base callers use this data to detect the DNA characters based on the input. In this paper, we discuss several shortcomings of prior base callers in the case of time-critical applications, privacy-aware design, and the problem of catastrophic forgetting. Next, we propose the LRDB model, a lightweight open-source model for private developments with a better read-identity (0.35% increase) for the target bacterial samples in the paper. We have limited the extent of training data and benefited from the transfer learning algorithm to make the active usage of the LRDB viable in critical applications. Henceforth, less training time for adapting to new DNA samples (in our case, Bacterial samples) is needed. Furthermore, LRDB can be modified concerning the user constraints as the results show a negligible accuracy loss in case of using fewer parameters. We have also assessed the noise-tolerance property, which offers about a 1.439% decline in accuracy for a 15dB noise injection, and the performance metrics show that the model executes in a medium speed range compared with current cutting-edge models.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2303.08915

Country:

North America > Canada > Ontario > Toronto (0.04)
Europe > Estonia > Harju County > Tallinn (0.04)
Europe > Germany (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Satellite imagery segmentation using U-NET

#artificialintelligenceOct-25-2022, 13:20:26 GMT

In this blog, we will conduct picture segmentation on a very limited dataset using U-Net, a popular segmentation CNN model. There will also be some customized loss functions used for training reasons, such as dice loss and Jaccard index metrics. The data that we will be working with comes from kaggle. The dataset is called Semantic segmentation of aerial imagery. The dataset has two sorts of files .jpg

dataset, satellite imagery segmentation, segmentation, (16 more...)

#artificialintelligence

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Memory Association Networks

Kim, Seokjun, Jang, Jaeeun, Jang, Yeonju, Choi, Seongyune, Kim, Hyeoncheol

arXiv.org Artificial IntelligenceDec-27-2021

Various networks have been designed in the deep learning field to date. Typically, images, sounds, text, hierarchical, and relational data are learned through the networks, and inductive learning is performed. But these networks are limited to specific datasets or specific tasks. Therefore, we designed artificial association networks that can simultaneously learn various datasets in one network like humans. And in the second study, deductive association networks were proposed to perform deductive reasoning.

association network, information, vector, (13 more...)

arXiv.org Artificial Intelligence

2111.02353

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

# 020 Overview of Semantic Segmentation methods - Master Data Science 08.11.2021

#artificialintelligenceNov-30-2021, 12:30:43 GMT

In this post, we will see how we can use Neural Networks for the segmentation task. To be more precise, it will be about Semantic Segmentation. The goal of Semantic Segmentation is to label each pixel of an image with a corresponding class. When we start to learn Deep Learning our first experiments are tasks that are usually related to solving classification problems. We need to determine the class label of the object in the image.

convolution, output image, segmentation, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Text Classification using Transformers

#artificialintelligenceMar-18-2021, 05:55:44 GMT

In this part, we will try to understand the Encoder-Decoder architecture of the Multi-Head Self-Attention Transformer network with some code in PyTorch. There won't be any theory involved(better theoretical version can be found here) just the barebones of the network and how can one write this network on its own in PyTorch. The architecture comprising the Transformer model is divided into two parts -- the Encoder part and the Decoder part. Several other things combine to form the Encoder and Decoder parts. Let's start with the Encoder.

decoder part, encoder part, text classification, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)

Add feedback

Establishing strong imputation performance of a denoising autoencoder in a wide range of missing data problems

Abiri, Najmeh, Linse, Björn, Edén, Patrik, Ohlsson, Mattias

arXiv.org Machine LearningApr-6-2020

Dealing with missing data in data analysis is inevitable. Although powerful imputation methods that address this problem exist, there is still much room for improvement. In this study, we examined single imputation based on deep autoencoders, motivated by the apparent success of deep learning to efficiently extract useful dataset features. We have developed a consistent framework for both training and imputation. Moreover, we benchmarked the results against state-of-the-art imputation methods on different data sizes and characteristics. The work was not limited to the one-type variable dataset; we also imputed missing data with multi-type variables, e.g., a combination of binary, categorical, and continuous attributes. To evaluate the imputation methods, we randomly corrupted the complete data, with varying degrees of corruption, and then compared the imputed and original values. In all experiments, the developed autoencoder obtained the smallest error for all ranges of initial data corruption.

autoencoder, dataset, imputation, (13 more...)

arXiv.org Machine Learning

doi: 10.1016/j.neucom.2019.07.065

2004.02584

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Sweden > Halland County > Halmstad (0.04)
Europe > Sweden > Skåne County > Lund (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.89)
Research Report > Experimental Study (0.68)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback